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INTRODUCTION 

Antibody (Ab) affinity maturation (AAM) 
referred originally to the observed increase 
in average Ab affinity against a hapten (1). 
Later, it was found that AAM is associated 
with the formation of transient lymphoid 
structures in the B cell zones of lymphoid 
tissues, called germinal centers (GC), dur- 
ing T-cell dependent immune responses in 
higher vertebrates (2). 

In another line of research, AAM was 
related to the occurrence of mutations in 
the variable ( V) domain of Ab heavy (H) 
and light (L) chains, respectively, Vh and 
Vl. In those works, a mutational analysis 
of Ab V genes was performed, initially on 
bulk splenic plasma B cells and later on GC 
B cells vs. extrafollicular B cells, after suc- 
cessive immunizations. The results showed 
typically an increased number of mutated 
GC B cells (3-6), and an accumulation of 
mutations per Ab chain during the ongoing 
immune response, with many mutated B 
cells displaying higher affinity for the hap- 
ten used for immunization. This provided 
strong support to a previously suggested 
concept (7), according to which AAM is 
a B-cell receptor (BCR)-based Darwinian 
evolutionary process. 

A few years later, two complementary 
hypotheses were proposed. The first one, 
based on a mathematical model, suggested 
that, for the fastest production of high 
affinity Abs, the mutation rate in GC B 
cells should be minimal before GCs reach 
a threshold size, and then switch abruptly 



to the maximal possible rate (8). The sec- 
ond hypothesis proposed, for the assumed 
Darwinian process, alternating cycles of B 
cell proliferation plus mutation plus selec- 
tion (9). These ideas were soon extended in 
another modeling work, showing that Ab 
affinity can be maximized when the muta- 
tional mechanism switches on and off reg- 
ularly (10). These results contributed con- 
siderably to strengthen the general belief 
in the recycling or multiple-step selec- 
tion hypothesis. On the other hand, more 
recently, alternative B cell selection mech- 
anisms were proposed that do not require 
multiple-step selection in order to be com- 
patible with observed levels of Ab affin- 
ity increase during a primary immune 
response (11, 12). 

There is still much to learn about AAM 
mechanisms (13-17), and there is a need 
to clarify some aspects of the GC phys- 
iology where overinterpretation and pre- 
conceptions prevail (18, 19). The multiple- 
step selection hypothesis is a prominent 
example of a concept that, having impor- 
tant basic and practical implications, has 
never been confirmed. Clearly, a direct 
way to establish it would be to observe 
multiple BCR-mediated selection events 
by tracking individual B cells via imag- 
ing of lymphatic tissue, observing SHM 
taking place between selection rounds. 
However, direct observation of even one 
selection event is not yet possible. At the 
same time, attempts to interpret indi- 
rect data must be faulty due to the need 



to use unverified assumptions on AAM 
mechanisms. 

Therefore, we take here a radically dif- 
ferent approach: we propose to consider the 
single-step selection concept to be a null- 
hypothesis which should be attempted to 
be falsified (Figure 1). Because this ansatz 
puts the focus on a process of random non- 
directed acquisition of mutations, it mini- 
mizes the need for unverified assumptions. 
And because mutations carry the signature 
of the selection process, the data to be used 
should consist of Ab V gene sequences. 
In the following, we examine two possible 
falsifying strategies. 

FALSIFYING THE NULL-HYPOTHESIS 
WITH PHYLOGENETIC TREES 

Let us consider all mutated Vh or Vl 
sequences belonging to a given B cell lin- 
eage. The corresponding phylogenetic tree 
is a result of the evolutionary process 
undergone by the initial sequence, and as 
such, is shaped by the various factors per- 
taining to the affinity maturation process. 

Extensive work was performed on devel- 
oping methods to build phylogenetic trees 
from V genes of a common lineage (20) 
and to analyze how shape measures depend 
on AAM mechanisms (21, 22). These simu- 
lations show that the tree shapes vary most 
on the initial clone affinity and the selection 
threshold, and much less in dependence on 
the rates of GC B cell recycling (22), not 
allowing for a unique mapping from tree 
shapes to selection mechanisms - likely 
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FIGURE 1 | Sketch of proliferation plus SHM and selection history of 
a plasma or memory cell. (A) Single-step selection. Several cell division 
plus mutation cycles and a single final selection step before terminal 
differentiation into a plasma or memory cell. (B) Multiple-step selection. 



Several alternating rounds of cell division plus mutation and selection 
[corresponding to several rounds of (A)], followed by terminal 
differentiation into a plasma or memory cell after the last selection step. 
Vertical bars indicate mutations. 



because the investigated trees were small. 
In addition to global measures not always 
being helpful in pointing to mechanisms 
at the micro-evolutionary scale (15, 23, 
24), simulation of global measures like 
peak total GC B cell numbers did not lead 
to results that contradict the single-step 
hypothesis (22). 

Summing up, the null-hypothesis has 
never been falsified by examining the 
shapes of phylogenetic trees. 

FALSIFYING THE NULL-HYPOTHESIS BY 
COUNTING RECURRENT MUTATIONS 

Let us consider a thought experiment, in 
which two syngeneic mice with a single, 
non-mutated B cell clone expressing the 
same V genes, are immunized with the 
same antigen, and that both mice initiate 
a process of AAM in which the selection 
forces acting on the diversified V sequences 
are identical. Let us further assume that 
the baseline mutability during SHM is 
uniformly distributed along rearranged V 
genes and independent on the time elapsed 
after immunization. After a number of 
days, a sample of Ab V genes is sequenced 



and an independent set of V sequences is 
obtained for each mouse. As a result of 
the stochastic nature of SHM, the muta- 
tion distribution in both sets may be quite 
different. If nevertheless an identical set 
of mutations appears in both independent 
data sets, we call it a recurrent mutation 
pattern. 

How likely is it to find a recurrent 
mutation pattern? Assuming that the AAM 
process in our mice followed a single-step 
selection scheme (Figure 1A), we can make 
a first rough estimation. Consider the prob- 
ability p£ to obtain a particular pattern 
of k mutations out of all possible pat- 
terns of k mutations, which is pf = -K, 

with = 3 k x (f) being the number of 
possible mutation patterns of size k, and 
L being the V sequence length. When an 
Ab V gene of length L = 300 and k muta- 
tions is produced by the SHM process, the 
probability that the outcome is a particu- 
lar mutation pattern is pf 30 ~ 10~ 3 for 
it = 1, and pf 0 « 10~ 6 for k = 2. During 
an AAM process, thousands of mutated 
B cells are generated in a mouse; hence, 
the probability of finding a given mutation 



pattern of size k among all mutated B cells 

is 1 — (1 — pl°°) N , where N is the number 
of B cells with k mutations. Let us assume 
that N= 10 5 B cells got k = 2 mutations. 
Then, as a crude estimation, the proba- 
bility of a given mutation pattern of size 
k = 1 and k = 2 among all those B cells 

is, respectively, 1 — (1 — p 300 ) 10 ~ 1 and 

1 - (1 - pf 0 ) 105 « 0.1. This means that 
recurrent mutation patterns of such a small 
size are rather likely to appear in a single- 
step setting, and are therefore not suitable 
to contradict the null-hypothesis. 

However, for k = 5, the probability of 
obtaining a particular mutation pattern 
by chance among 10 5 B cells is only 1 — 

(1 - pf 00 ) 10 ' fa 2 x 10" 8 . Hence it is highly 
improbable that both mice in our thought 
experiment could produce, by a single-step 
process, the same recurrent mutation pat- 
tern. On the other hand, in a multiple-step 
selection process, single mutations can be 
selected one by one (Figure IB). There- 
fore, finding recurrent mutation patterns of 
that size or larger would be consistent with 
a multiple-step scheme while deeming the 
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single-step null-hypothesis highly improb- 
able. Admittedly, the above is a simpli- 
fied probability calculation. A more realis- 
tic estimation, based on calculations that 
include reversions and different baseline 
mutabilities, does not change the above 
conclusions (see Supplementary Material). 

Data from experiments along the main 
idea of our thought experiment do exist. 
For instance, Ragl~'~ double transgenic 
mice for Ab H and L chains are available 
(25). Also, hapten-conjugated proteins can 
yield a large percentage of canonical V gene 
sequences (3). 

In a survey, we found a number of 
publications that present Ab V sequences 
obtained from syngeneic mice under the 
same immunization protocol (3, 4, 26-31). 
In all the data analyzed so far we could 
not find a single instance of mutated V 
sequences from GC B cells sharing three or 
more mutations. Also, a substantial set of 
independent murine Vjj genes with a com- 
mon Vr germline sequence was recently 
collected from literature and examined for 
recurrent patterns (32). The search yielded 
not a single case of shared triplets. 

In summary, to our knowledge, there is 
no published independent sequence data 
that contradicts the single-step hypothesis. 

CHALLENGE FOR FUTURE RESEARCH: 
TRYING TO FALSIFY THE SINGLE-STEP 
HYPOTHESIS WITH 
HIGH-THROUGHPUT SEQUENCE DATA 

A clear understanding of AAM requires 
answering the question whether the single- 
step or the multiple-step selection hypoth- 
esis hold. A straightforward approach 
would be direct observation of SHM and 
Ab-mediated selection events via in vivo 
imaging, but this is technically not yet 
possible. Similarly daunting is to try to 
infer the frequency of selection steps from 
indirect observations while making use of 
non-validated assumptions. 

Our proposal of falsifying the single- 
step null-hypothesis provides a way out. 
This hypothesis does not preclude the 
knowledge of any mechanisms besides the 
stochastic process of SHM. Moreover, this 
knowledge does not need to be highly 
precise because an upper estimation of 
probabilities under the null-hypothesis can 
suffice. 

Therefore, examining Ab V gene 
sequence data with the aim of falsifying 



the single-step hypothesis is a powerful 
technique. 

Next generation sequencing currently 
allows to obtain suitable Ab V gene 
sequence data (33, 34). One strategy for 
the search of contradictions is to calculate, 
under the null-hypothesis, the probability 
distribution of recurrent mutation patterns 
acquired independently. 

A detailed calculation of probability dis- 
tributions is shown in the Supplementary 
Material. It allows to estimate that, for given 
realistic parameters (see Table therein), the 
probabilities of observing recurrent pat- 
terns are much lower than 0.05. In case of a 
very strict multiple-step selection, the null- 
hypothesis can potentially be contradicted 
with very few sequences. 

This strategy can be pursued both for 
independent and same-lineage V gene 
sequence sets. In the latter case, the proba- 
bility calculation must be performed exclu- 
sively for recurrent patterns that cannot 
possibly stem from common ancestors. 

Another strategy comprises trying to 
contradict the null-hypothesis by exam- 
ining the structure of a same-lineage 
V sequence population for signs of a 
directed multi-step process as in contrast 
to an undirected, random process. Such 
signs can be, for instance, the emergence 
of independent quasi-species (35), or of 
coalescence times typical to multi-step 
processes (36). 

These methods require however: (i) that 
the AAM process has been ongoing long 
enough for population structures to have 
emerged, and (ii) that enough sequences 
can be retrieved to make these structures 
visible. It is well possible that times are too 
short and clonal sizes too small to provide 
this sort of data. 

No matter which strategy turns out 
to be the best, important challenges are 
still open. For instance, present meth- 
ods of calculating the pairwise probability 
that sequences pertain to a common or 
to a different B cell lineage (32) need to 
be improved, especially where short junc- 
tional regions make identification of lin- 
eage difficult. With such an analysis work- 
ing, different independent Ab V sequence 
sets can also be retrieved from the same 
individual. A further challenge consists 
of devising estimators of the recurrent 
mutation pattern probability distributions 
adequate to the respective experimental 



setup. Good estimations of baseline muta- 
bility would be helpful; however, using 
upper estimations of probability might be 
sufficient. 

For pinning down the actual AAM 
process, it is not advisable to examine 
data sets that include sequences of mem- 
ory cells, to avoid the risk of analyzing 
repeated rounds of immunizations against 
the same or different antigens. Thus, the 
design of experiments that consider both 
the anatomical compartment from which 
B cells are taken and strategies that maxi- 
mize the size of data sets, poses a challenge 
as well. 

While multiple-step selection points to 
AAM as an accelerated molecular evolution 
process maximizing Ab affinity increase, 
single-step selection points at an optimiza- 
tion process of Ab repertoires in which both 
Ab affinity enhancement and diversifica- 
tion can be equally relevant (14, 17, 37). 
Striving to discover which is right must 
be a priority to those interested in unveil- 
ing AAM mechanisms. Trying to falsify the 
single-step hypothesis is not easy and might 
be even impossible - for instance, if the 
underlying process is indeed a single-step 
one. But it is, in our opinion, the only 
viable way. 
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